Policies that Generalize: Solving Many Planning Problems with the Same Policy
نویسندگان
چکیده
We establish conditions under which memoryless policies and finite-state controllers that solve one partially observable non-deterministic problem (PONDP) generalize to other problems; namely, problems that have a similar structure and share the same action and observation space. This is relevant to generalized planning where plans that work for many problems are sought, and to transfer learning where knowledge gained in the solution of one problem is to be used on related problems. We use a logical setting where uncertainty is represented by sets of states and the goal is to be achieved with certainty. While this gives us crisp notions of solution policies and generalization, the account also applies to probabilistic PONDs, i.e., Goal POMDPs.
منابع مشابه
An Improvement in WRP Block Replacement Policy with Reviewing and Solving its Problems
One of the most important items for better file system performance is efficient buffering of disk blocks in main memory. Efficient buffering helps to reduce the widespeed gap between main memory and hard disks. In this buffering system, the block replacement policy is one of the most important design decisions that determines which disk block should be replaced when the buffer is full. To o...
متن کاملAn Improvement in WRP Block Replacement Policy with Reviewing and Solving its Problems
One of the most important items for better file system performance is efficient buffering of disk blocks in main memory. Efficient buffering helps to reduce the widespeed gap between main memory and hard disks. In this buffering system, the block replacement policy is one of the most important design decisions that determines which disk block should be replaced when the buffer is full. To o...
متن کاملIssues with Language Policy and Planning in Iranian Higher Education
In this study, we attempt to bring to light various organisational and implementational clashes relevant to the conceptualisation of language policies at national level, and the planning of local practices with regard to degree programmes, language journals and conferences in Iranian higher education. We also prove that in its current status, the ELT syllabus in Iran, both at national and local...
متن کاملA Comparative Review on National Alcohol Prevention Policies in Different Selected Countries
Alcohol, with its impact on both communicable and non-communicable diseases, is considered as the third global public health priority. Alcohol ranked third among causes of ill health and premature death, and ranked second in terms of cost among all the substances of abuse, after tobacco, even though nearly half the world’s population drinks alcohol. In most countries, where alcohol is considere...
متن کاملApproximate Policy Iteration with a Policy Language Bias: Solving Relational Markov Decision Processes
We study an approach to policy selection for large relational Markov Decision Processes (MDPs). We consider a variant of approximate policy iteration (API) that replaces the usual value-function learning step with a learning step in policy space. This is advantageous in domains where good policies are easier to represent and learn than the corresponding value functions, which is often the case ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015